Volume 1, Issue 2, May 2021
Research PaperOpenAccess
Federated learning for privacy-preserving data access
*Malgorzata Smietanka^1, Hirsh Pithadia² and Philip Treleaven³**
¹University College London, Gower St, Bloomsbury, London WC1E 6BT, United Kingdom. E-mail: malgorzata.wasiewicz.17@ucl.ac.uk ²University College London, Gower St, Bloomsbury, London WC1E 6BT, United Kingdom. E-mail: H.Pithadia@cs.ucl.ac.uk ³University College London, Gower St, Bloomsbury, London WC1E 6BT, United Kingdom. E-mail: p.treleaven@ucl.ac.uk *Corresponding Author
Int.J.Data.Sci. and Big Data Anal. 1(2) (2021) 1-13, DOI: https://doi.org/10.51483/IJDSBDA.1.2.2021.1-13
Received: 20/12/2020\|Accepted: 13/04/2021\|Published: 05/05/2021

Abstract

Federated learning is a pioneering privacy-preserving data technology and also a new machine learning model trained on distributed data sets. Companies collect huge amounts of historic and real-time data to drive their business and collaborate with other organizations. However, data privacy is becoming increasingly important because of regulations (e.g., EU GDPR) and the need to protect their sensitive and personal data. Companies need to manage data access: firstly within their organizations (so they can control staff access), and secondly protecting raw data when collaborating with third parties. What is more, companies are increasingly looking to ‘monetize’ the data they’ve collected. However, under new legislations, utilizing data by different organization is becoming increasingly difficult (Yu,2016). Federated learning pioneered by Google is the emerging privacy- preserving data technology and also a new class of distributed machine learning models. This paper discusses federated learning as a solution for privacy-preserving data access and distributed machine learning applied to distributed data sets. It also presents a privacy-preserving federated learning infrastructure.

Keywords: Federated Learning, Privacy, Machine Learning, Data Science, InsurTech

Full text

Download

International Journal of Data Science and Big Data Analytics

Federated learning for privacy-preserving data access

Abstract